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Abstract 

We present several results on the mixing time of the Glauber dynamics for 
sampling from the Gibbs distribution in the ferromagnetic Potts model. At a 
fixed temperature and interaction strength, we study the interplay between the 
maximum degree (A) of the underlying graph and the number of colours or spins 
((/) in determining whether the dynamics mixes rapidly or not. We find a lower 
bound L on the number of colours such that Glauber dynamics is rapidly mixing 
if at least L colours are used. We give a closely- matching upper bound U on 
the number of colours such that with probability that tends to 1, the Glauber 
dynamics mixes slowly on random A-regular graphs when at most U colours are 
used. We show that our bounds can be improved if we restrict attention to certain 
types of graphs of maximum degree A, e.g. toroidal grids for A = 4. 



1 Introduction 

The Potts model was introduced in 1952 [23J as a generalisation of the Ising model of 
magnetism. The Potts model has been extensively studied not only in statistical physics, 
but also in computer science, mathematics and further afield. In physics the main 
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interest is in studying phase transitions and modelling the evolution of non-equilibrium 
particle systems; see [30] for a survey. In computer science, the Potts model is a test-bed 
for approximation algorithms and techniques. It has also been heavily studied in the 
areas of discrete mathematics and graph theory, through an equivalence to the Tutte 
polynomial of a graph [29], and thereby links to the chromatic polynomial and many 
other graph invariants. The Potts model and its extensions have also appeared many 
times in the social sciences, for example in modelling financial markets [28] and voter 
interaction in social networks [5], and in biology [T3] . 

In graph-theoretic language, the Potts model assigns a weight to each possible colour- 
ing of a graph (not necessarily proper), and we are interested in sampling from the dis- 
tribution induced by the weights. The main obstacle to sampling is that the appropriate 
normalisation factor, the sum of the weights of all colourings, is hard to compute. To 
be precise: for a graph G = (V, E), a configuration a is a function which assigns to each 
vertex one of q possible colours (also called states or spins). The probability of finding 
the system in a given configuration a is given by the Gibbs distribution: 

7r(<r) = Z-V E (^)^ J<5 (^), 

where £(<7j, cr,) is the Kronecker-£ (taking value 1 if the colours ex, and aj, assigned to 
vertices i and j respectively, are the same, and taking value otherwise), (3 = (kT)^ 1 > 
is the inverse temperature (k is Boltzman's constant and T is temperature), and Z = 
Z(G, (3, J), the partition function, is the appropriate normalisation factor to make this a 
probability distribution. The strength of the interaction between neighbouring vertices 
is given by the coupling constant J. If J > then the bias is towards having many 
edges with like colours at the endpoints; this is the ferromagnetic region. Conversely, 
if J < then the bias is towards few edges with like colours at the endpoints: the 
anti-ferromagnetic region. In our treatment we will regard e^ J as a single parameter A, 
which we will call the fugacity since this combined parameter plays the same role as the 
fugacity in lattice gas models. Setting /i(cr) to be the number of monochromatic edges 
in a colouring a (that is, /i(<r) = Ylaj)eE &{ a i-> a i))-> we obtain the equivalent formula 
Z(G,A) = £ ff A"W. 

When q — 1 the evaluation of the partition function is trivial. In almost all other 
cases it is #P-hard to compute the partition function exactly, and thus there can be no 
efficient algorithm (running in time polynomial in the size of the underlying graph) if 
P^NP [16] . Therefore attention has been focused on approximation algorithms. The 
specific question is: for what classes of graphs and what ranges of q and A is there a fully 
polynomial randomised approximation scheme (FPRAS) for the partition function? In 
the anti- ferromagnetic case, A < 1, there can be no FPRAS for the partition function 
unless NP=RP [12], except when q = 1 and at a few special points. For the ferromag- 
netic region, A > 1, there is only known to be an FPRAS when q = 2 (the Ising model) 
for general graphs at any temperature JT9]. There is also an FPRAS for the entire fer- 
romagnetic region (no restriction on q) if we restrict the underlying graphs to the class 
of dense graphs (those having minimum degree Q(n) p], or having edge connectivity at 
least Q (log n) [21]). In terms of approximation complexity, approximating the partition 
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function of the ferromagnetic Potts model is equivalent to #BIS - the problem of ap- 
proximating the number of independent sets in a bipartite graph [llj. This puts it in an 
interesting class of approximation problems not known to be hard, but none of which 
have been shown to exhibit an FPRAS [6]. 

A standard approach to approximating the partition function is to simulate Glauber 
dynamics. In Glauber dynamics the following process is iterated (starting from any given 
configuration): a random vertex updates its colouring by selecting a colour according to 
the local Gibbs distribution induced by the current colourings of its neighbours. (This 
will be formalised in the next subsection.) The distribution on configurations obtained 
after t steps of Glauber dynamics converges to an equilibrium given by the global Gibbs 
distribution on the whole graph, as t goes to infinity. The approximation is achieved by 
simulating the Glauber dynamics for long enough to generate a sample that is distributed 
with very nearly the equilibrium distribution. This process is Markov chain Monte Carlo 
sampling (MCMC) |20j. The close link between sampling and approximate counting 
means that if Glauber dynamics gets sufficiently close to equilibrium in polynomial time 
(in the size of the graph) then there is an FPRAS for the partition function. In this 
case the dynamics is said to mix rapidly. Physicists' understanding of phase transitions 
indicate that at sufficiently high temperature (all other things being equal) Glauber 
dynamics will mix rapidly, whereas at sufficiently low temperature Glauber dynamics 
will mix slowly [22]. However, determining the exact range of temperature in which 
Glauber dynamics mixes rapidly is, in general, open. 

In the anti-ferromagnetic case, where it is known that there can be no FPRAS in 
general, the MCMC technique has still yielded many results approximating the par- 
tition function for restricted classes of graph, notably bounded-degree graphs. In the 
zero temperature limit of the anti-ferromagnetic Potts model only proper vertex colour- 
ings have non-zero weight. Thus approximating the partition function is equivalent to 
approximately counting proper g-colourings of the underlying graph. Jerrum [17] first 
showed that provided the number of colours is more than twice the maximum degree of 
the graph then the Glauber dynamics will mix rapidly, also proved independently in the 
physics community by Salas and Sokal [21]. This result has been followed by numerous 
refinements gradually reducing the ratio of colours to degree required for rapid mixing: 
see [10] for a recent survey. In this paper we shall investigate the interplay of the max- 
imum degree A of the graph G and the number of colours q in determining whether 
the convergence of Glauber dynamics for the ferromagnetic Potts model is fast (rapid 
mixing) or slow. 

Acknowledgements: We are grateful to Ostap Hryniv and Gregory Markowsky for 
leading us to the generalised Holder's inequality (and to [9]) for Lemma [3751 We are 
also grateful to Mario Ullrich for his helpful comments. 

1.1 Definitions 

Throughout we shall be concerned with discrete-time, reversible, ergodic Markov chains 
with finite state space Q. Let M. be such a Markov chain with transition matrix P and 
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(unique) stationary distribution it. For e > and ifO, we define 
r x (M,e) = min{t : \\P\x, •) - 7t(0||tv < e}, 
where || • \\tv denotes total variation distance between two distributions: that is, 

H-<P'\\tv :=^|0(x)-0'(x)|. 

for any two probability distributions <fi, <fi' on Q. We define t(A4,e) = max^ t x (A4, e). 

Let G = (V, E) be a graph with n := |V|, and let [g] = {1, . . . , q} be a set of colours. 
We write fl = [q] v for the set of (not necessarily proper) g-colourings of G. Fix a 
constant A > 1, which is called the fugacity. The Gibbs distribution n = n(G,\,q) on 
fl is given by 

it (a) oc A" (ct) 

for all a G f2, where //(ex) denotes the number of monochromatic edges of G in the 
colouring a. More precisely, tt(o~) = /Z, where Z is the partition function 

Z = Z(G,X,q) = J2^ (a) - 

ff6!i 

The Glauber dynamics is a very simple Markov chain on fl, with stationary distri- 
bution given by the Gibbs distribution. Given a colouring X G fi, a vertex dgV, and 
a colour c G [g], let n(X, i>, c) denote the number of neighbours of v with colour c in X. 
Define the probability distribution <p x on [q] by 

<j> v x (c) oc A n(x '"' c) . 

The transition procedure of the Glauber dynamics from current state X t G is as 
follows: 

• choose a vertex t> of G uniformly at random; 

• choose a colour c G [q] according to the distribution = (f> Xt ; 



for each u G V let X m («) 



X t (u) iiu^v, 
c if tt = v . 



Then X t+1 is the new state. We write A^gd = A^gd(C, A, q) for the Glauber dynamics 
as described above. 

We say that AIgd mixes rapidly if t(A4,e) is polynomial in log that is, polyno- 
mial in n. If t(A4,e) is exponential in n, then we say that AIgd mixes slowly. 
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1.2 Results 



Our main results are stated below. In order to keep the presentation simple at this 
stage, we sometimes postpone giving the explicit relationships amongst constants and 
mixing times until later, but in each case, we direct the reader to where a more detailed 
statement can be found. 

In Theorem ll.il we present our first, and simplest, bound on the number of colours, 
as a function of A and A, that guarantees rapid mixing of Glauber dynamics. Although 
Theorem 11.11 follows from a standard coupling argument, for completeness we prove it 
here, as we will need this result later to establish our improved bounds. 

Theorem 1.1. Let A,q > 2 be integers and take A > 1 such that q > AA A + 1. Then 
the Glauber dynamics of the q-state Potts model at fugacity A mixes rapidly for the class 
of graphs of maximum degree A . 

Theorem 11.11 will be proved in Section 12. 2} see Proposition 12.21 for a more detailed 
statement. 

There is some overlap between Theorem 11.11 and a result of Hayes [151 Proposition 
14] for q = 2, which was generalised to arbitrary q by Ullrich J27J Corollary 2.14]. Ullrich 
showed that when the inverse temperature j3 satisfies (3 < 2c/ A for some < c < 1, 
then the Glauber dynamics is rapidly mixing, with mixing time bounded above by 
(1— c) _1 n log(ne _1 ). Since A = e 13 , the lower bound on q in the statement of Theorem ll.il 
can be expressed as 



Hence our result holds for a wider range of (3 when q > e 2 A + 1, while if q < A + 1 then 
Theorem 11.11 does not apply but [271 Corollary 2.14] is valid. For intermediate q, both 
results apply. 

In Theorem II .21 we improve the exponent of A in the bound, but at the expense of a 
larger constant. We also show that the exponent achieved is close to the best possible, 
by proving a corresponding slow-mixing bound for almost all regular graphs of degree 
A. 

Theorem 1.2. Fix an integer A > 2. For any r] e (0, 1) there are constants C\ and Ci 
( depending on rj and A ), such that for any integer q > 2 and any A > 1 

(i) if q > c 1 A A_1+?? then the Glauber dynamics of the q-state Potts model at fugacity 
A mixes rapidly for the class of connected graphs of maximum degree A; 

(ii) if q < C2A A 1 A ~ l v then the Glauber dynamics of the q-state Potts model at 
fugacity A mixes slowly for almost all regular graphs of degree A > 3. 

Theorem 11.21 is proved at the end of the paper: a more detailed statement of The- 
orem 11.2111 ) can be found in Theorem I2.14| while a more detailed statement of Theo- 
rem ll.2( ii) can be found in Theorem 14.41 
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Theorem ll.2f ii) is proved using a conductance argument. It turns out that conduc- 
tance for the Glauber dynamics is related to the expansion properties of the underlying 
graph, and so we prove that almost all A-regular graphs have the relevant property. 
This argument alone gives a worse bound than that in Theorem ll.2( ii) , but combined 
with the solution of an interesting extremal problem (proved in Section [3]), which we 
believe may be of independent interest, we are able to obtain the required improvement. 

Theorem ll.2f i) is proved by first using a coupling argument to prove a rapid- mixing 
result for block dynamics (a more general form of dynamics than Glauber dynamics) 
and then using a Markov chain comparison argument to obtain rapid mixing for Glauber 
dynamics. In proving Theorem ll.2( i). we derive a general combinatorial condition on 
graphs that guarantees rapid mixing of Glauber dynamics (Theorem 12.41 combined with 
Corollary I2.13p . This condition can be used to improve the bounds of Theorem ll.2f i) 
for graph classes of maximum degree A with "low expansion". We illustrate this in 
Theorem 11.31 below with the example of the toroidal grid. 

Theorem 1.3. For any rj G (0, 1) there are constants C3, C4 and C5 (depending on t]), 
such that for any positive integer q and any A > 1 

(i) if q > C3\ 3+v then the Glauber dynamics of the q-state Potts model at fugacity A 
mixes rapidly for the class of connected graphs of maximum degree 4; 

(ii) if q > c±\ 2+ri then the Glauber dynamics of the q-state Potts model at fugacity A 
mixes rapidly for the toroidal grid; 

2 

(iii) if q < C5A 3 _r? then the Glauber dynamics of the q-state Potts model at fugacity A 
mixes slowly for almost all regular graphs of degree 4. 

In particular, for sufficiently large A there is a positive integer q such that the Glauber 
dynamics of the q-state Potts model at fugacity A mixes rapidly for the toroidal grid, but 
slowly for almost all regular graphs of degree 4. 

The purpose of Theorem 11.31 is illustrative and it is proved at the end of the paper. 
Theorem ll.3f i) and (iii) are immediate consequence of Theorem 11.21 (by substituting 
A = 4), while Theorem 11.3( h) is a useful illustration of our general technique applied to 
the grid. A more detailed statement of Theorem II .3( ii) is given as part of Theorem 12. 151 

We note that much sharper bounds for the grid are known than are presented here. 
It is known that for the infinite 2-dimensional grid, the phase transition occurs at q = 
(A — l) 2 [30J, and that rapid mixing occurs for finite grids when q > (A — l) 2 = 0(A 2 ) [22] 
and Theorem 2.10 of [27]. Corresponding slow-mixing results for the grid also hold; see 
[3] and Remark 2.11 of [27]. 

Section [2] contains our results on rapid mixing of Glauber dynamics. Section [3] is 
devoted to an extremal problem whose solution allows us to obtain improved bounds 
for our slow-mixing results in Section HJ 
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2 Mixing time upper bounds 



Our goal in this section is to give good lower bounds on the number of colours needed for 
the Glauber dynamics to mix rapidly. We begin by describing the notions of coupling 
and path coupling, which are very useful tools in proving upper bounds on mixing times 
for Markov chains. In Section I2.2[ we apply path coupling directly to the Glauber 
dynamics of bounded-degree graphs to obtain our first lower bound on the number of 
colours needed for rapid mixing. In Section 12.31 we consider block dynamics, a more 
general type of dynamics that can be used to sample from the Gibbs distribution. We 
give a general lower bound on the number of colours needed for rapid mixing of block 
dynamics (Theorem 12.31) . We illustrate how to apply Theorem 12.31 to bounded-degree 
graphs in Section 12.41 In Section 12. 5[ we relate the mixing times of Glauber dynamics 
to that of the block dynamics and show how this gives various improvements to the 
bounds obtained in Section 12.21 This enables us, in Theorems 12.141 and 12.151 to prove 
what is needed for Theorem 11.21 part (i), and Theorem 11.31 parts (i) and (ii). Note that 
the final proofs of Theorems 11.21 and 11.31 are left until we have all the pieces, at the end 
of Section HI 

2.1 Coupling 

The notion of coupling (more specifically path coupling [4J) lies at the heart of our proofs 
of upper bounds for mixing times. We give the basic setup in this section. 

Let Ai = (X t ) be a Markov chain with transition matrix P. A coupling for Ai 
is a stochastic process (A t ,B t ) on Q x Q such that each of (A t ) and (B t ), considered 
independently, is a faithful copy of (X t ). Since all our processes are time-homogeneous, 
a coupling is determined by its transition matrix: given elements (a, b) and (a', b') of 
Q x Q, let P'((a,b),(a',b')) be the probability that (A t+1 ,B t+1 ) = (a',b f ) given that 
(A t , B t ) = (a, b). Since (A t , B t ) is a coupling, for each fixed (a, b) G Q x Q, we have 



Under path coupling, the coupling is only defined on a subset A of Q x Q. This restricted 
coupling is then extended to a coupling on the whole of Q x Q along paths in the state 
space Q. In our setting, we have Q = [q] , where V is the vertex set of some fixed graph. 
For a, a' G Q, we write d(a, a') for the number of vertices on which a and a' differ in 
colour (that is, the Hamming distance). Define A C Q x Q by 



The key property of A required for the path coupling method is that for any a, a' G fi, 
by recolouring the d(a, a') disagree vertices one by one in an arbitrary order, we obtain a 
path of length d(a, a') from a to a' , with consecutive elements of the path corresponding 
to an element of A. 




for all a' G f2; 



P'((a, b ), K &')) = p iP, b') for all b' G Q. 



A = {(a,a'):d(a,a') = l}. 
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Lemma 2.1 (See [8] for example). Letfl = [q] v and A be as above, withn := \V\, and let 
M. be some Markov chain on Q. Suppose that we can define a coupling (A, B) i— > (A', B') 
for M. on A such that for some constant (3 < 1 and all (A, B) 6 A we have 

E(d(A',B') | (A,B)) < (3. 

Then by path coupling we may conclude that 

T(ME) <Mli£_l>. 

2.2 Glauber dynamics 

Our goal in this subsection is to prove Theorem 11.11 In the subsections that follow, we 
shall see how we can improve Proposition 12.21 in some special cases, but in Section HI we 
shall see that the bound given below is close to best possible, at least in terms of the 
exponent of A. 

We actually prove the following proposition, which immediately implies Theorem ll.il 
but also provides a bound on the mixing time. The proof is a standard coupling calcu- 
lation. 

Proposition 2.2. Let G be a graph with maximum degree A, and fix a fugacity A > 
1. Suppose that q is an integer which satisfies q > AA A + 1. Recall that A^gd = 
A^gd(C, X,q) denotes the Glauber dynamics for the q-state Potts model on G with fu- 
gacity A. Then 

r(M G D,e) < (A + ljnlog^- 1 ). 

Proof. Fix (A, B) 6 A and let u be the (unique) vertex which is coloured differently 
by A and B. We define a coupling (A, B) i— > (A', B') as follows: choose a vertex v of 
G uniformly at random, and obtain A' (respectively, B') by updating the colour of the 
vertex v in A (respectively, B) according to the distributions 4>a '■= (ft a (respectively, 
4>b '■= <P"b)- The joint distribution on (0^, 0#) is chosen so as to maximise the probability 
that A'{y) = B'{v). Call this maximised probability p = p(v,A,B). It is not hard to 
see that 

1 ~P = j \^ A ^ ~ ^b{c)\ = \\(j) A - <Pb\\tv- 

ce[q] 

Observe that p(v, A, B) = 1 if v = u or if v is not a neighbour of u (because in both 
cases, A and B assign the same colours to the neighbours of v and so <pA an d <Pb are 
the same distribution). 

Now assume that v is a neighbour of u, so that 4>a and <$b are different distributions. 
Without loss of generality, we may assume that A(u) = 1 and B(u) = 2. Let := 
n(A,v,i), that is, aj is the number of neighbours of v coloured i by A. Similarly, let 
hi := n(B, v, i). Note that b\ = a\ — 1, b 2 = a 2 + 1 and 6j = Oj for i — 3, . . . , q. Define 

q i 
Z A = J2 xax and z b = J2 Xb ' = z a + ( 1 ~ A" 1 )(A a2+1 - A ai ), 

i=l i=l 
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and assume without loss of generality that Z B < Za- It is easy to see that </>a(z) < 
for i = 2, . . . , q and hence 4>a{X) > 0b(1)- Thus 

Ua ~ M\tv = max \<j> A (R) - <$> B (R)\ = \c/> A (l) - <j) B (l)\ = 

RC[q] Zj A Zj B 

Given a = (ai,...,a q ) G [A] 9 , define f(a,X,q) = — ^r-, and let g(X,q) be the 
maximum of / over all a G [A] 9 subject to a\ + • • • + a q — A. 
Observe that 



E(d(A',B')-l\(A,B)) = (-l)F(v = u)+ ^ (1 - p(v, A, B)) < —— H g(\, q). 

We give an easy upper bound for g(X, q) as follows. First, for all a G [A] q we have 

f(a,X,q) < — . 

The right hand side of the above is increasing in all directions of the form e\ — e^, where 
ei, . . . , e q is the standard basis for M 9 . Therefore the right hand side is maximised when 
a = (A, 0, . . . , 0) giving 

^ (A '^ AA + \-l ^ ATT' 
using the lower bound on q to obtain the final inequality Therefore. 

A 



E(d(A',B')\(A,B)) <l + l(-l 



1 



A + iy (A + l)n 

Applying Lemma 12.11 completes the proof. □ 



2.3 Block dynamics 

In this section we begin the analysis of block dynamics in which, at each step, the colours 
of several vertices (or a block of vertices) are updated. We first present the framework 
and show general results on block dynamics. In the next subsection we discuss suitable 
choices of blocks and, in Theorem 12 .7\ show rapid mixing of block dynamics for certain 
block systems. 

As before, let G = (V,E) be a graph, fix A > 1 and let Q = [q] v , where [q] = 
{1, . . . , q}. Let S = {Si, . . . , Sr} be a collection of subsets of V such that UsesS = V. 
Each element of S is called a block, and we call § a block system for G. Fix a probability 
distribution ip on S. We define a Markov chain A^bd = A^^G, A, q) with state space 
Q, which we call the {§>,ip)-block dynamics. We ensure that the new chain also has the 
Gibbs distribution as its stationary distribution. First we need some more notation. 

Given S G S, for c G [q] s and X G Cl we let X^ s ' c ^ G be the colouring defined by 



X (S,c), 



u 



X(u) iiu^S, 
c(u) if u G S. 
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Let Hx,s{c) denote the number of monochromatic edges in X( s > c ) which are incident with 
at least one vertex of S. Finally, define the distribution (p x ,s on [l] S by 

<f> x>s (c) oc X^ c \ that is, <f> Xt8 (c) = 

^x,s 

where 



'xjs = • 1 ''-''' s<c, 



The transition procedure of the (§, ijj)-b\ock dynamics can now be described. From 
current state X t G Q, obtain the new state X t +i 6 O as follows: 

• choose S G S according to the distribution ip; 

• choose a colouring c G [q] s for S from the distribution 4>x t ,s] 

• let X m = X t ^ c \ 

The stationary distribution of this chain is the Gibbs distribution on Q. 

Theorem 12 . 3 1 b elow gives a sufficient condition on the number of colours for the (S, ip)- 
block dynamics to be rapidly mixing. The result is stated in terms of three parameters 
which we now define. 

For S C V, write dS for the set of vertices in V \ S that have a neighbour in S. 
Write s := max^gg |5| for the size of the largest block in S. Let S G S be a random 
block chosen according to the distribution ifi. Given v G V, define 

ij){v) = F(v G S), ip d (v) = ¥(v G OS). 

Our first parameter d + is 

d+ = <9+(S) = max|«9Sr in{|5| ' |95|} . (1) 
Let ^ m in := min^gv ip{v) and define our second parameter \& by 

$ = ^(S,^) = max^4. (2) 

v&V ljj{v) 

Given A C V and IgSI, write X\a for the colouring X restricted to A. Consider a 
colouring c G [q] s . A colour used by c is called free with respect to X, S if it appears in 
X\s but not in X\q S . Write f(X, S, c) for the number of free colours in c with respect 
to X, S. Our third parameter is defined by 



/i+ = /i + (S) = max max /=0 ma | X_ i max| : c G [g] , /(X, 5, c) = / j . (3) 

The parameter /i + is a scaled maximum value of /ix,s(c), ruling out colourings in which 
every element of S receives a distinct free colour (such colourings will be treated sepa- 
rately in the proof below). Although the definition of /i + gives a dependency on q, in 
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our applications on bounded-degree graphs we can bound /i + independently of q (see 
Proposition 12. 4p . Hence we suppress this dependence in our notation. 

The following theorem gives a sufficient condition on the number of colours for (S, ift)- 
block dynamics to be rapidly mixing. We restrict our attention to connected graphs, 
and to avoid trivialities we do not allow the vertex set V to be a block. 

Theorem 2.3. Let G = (V, E) be a connected graph and let $ be a block system for G 
such that V G" S. Let if) be a distribution on S and fix A > 1. If 



(where parameters s, d + , ^ and fi + are as defined above) then the (S,^) -block dynamics 



We remark that for the bound q > (2s) s+1 d + ^ A M in Theorem 12.31 we expect 
the constant multiplicative factor (2s) s+1 d + ty can be improved; however we have not 
attempted to do this in order to keep our treatment simple. 

Proof. We define a coupling (A, B) i— > (A', B') for A^bd on A as follows. Given (A, B) G 
A, let u = u(A, B) be the (unique) vertex which is coloured differently by A and B. 
We choose S G S at random using the distribution ip and obtain A' (respectively, 
B') by updating the colouring of S in A (respectively, B) according to the distribution 
4>a = 4>a,s (respectively, 4>b = (pB,s)] this will give a coupling since A and B are updated 
using the transition procedure of A^bd- We choose the joint distribution on ((/>A,<j>B) 
so as to maximise the probability that A'\$ = B'\g. Call this maximised probability 
p(S, A, B). Observe that p(S,A,B) — 1 if u ^ dS (because A and B assign the same 
colours to dS, so 4>a and 4>b are the same distribution). For the case that u G dS, we 
uniformly bound p(S, A, B) by setting 



q > {2s) s+l d + * A M 



+ 



-Mbd = M^(G, A, q) satisfies 



r(M BD ,e) <2ip-l n log(ns- 1 ). 



p :— min min p(S,A,B). 
(A,B)e\ ses-.ueds 

(Let p = 1 if u $l dS for all S G S.) Now for all S G S with u G OS we have 




(4) 



max(Z j4j 5, Z B ,s) ' 



We claim that 




(5) 



for all X G Q and S G S. If ^ holds then substituting into (j4j) gives 



1 



p > 1 - 



2s^ 
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which in turn implies that 

E(d(A',B')-l\(A,B)) < -^( u ) + \S\^ d (u)(l-p) 

u \ ft \ s \^d{u) H v 

< -V>min(l-^(l-p)) 

~ 2~ ' 

The theorem follows from this, by Lemma [2. II So it remains to establish (jSJ). 

Fix X G fi and S 1 G S. For any colouring c, write Q(c) for the set of colours used 
by c. Given a colouring c G [g] 5 , the colour classes of c define a partition P of 5 into 
(unordered) nonempty parts. (Here, we think of a partition P of S as a set of nonempty 
parts {Pi, . . . , P t } where p C S are disjoint and Ua^pA = S.) Let F C P be the set of 
colour classes corresponding to colours which are free with respect to X, S (in the given 
colouring c). 

Conversely, we can start from a partition P of S and a subset P of P. Given a set of 
|P| colours, we can form a colouring of 5 by assigning a distinct colour to each part of 
P such that the colour assigned to A G P belongs to [g] \ Q(X|g 5 ) if and only if A G F. 
Any colouring which can be formed in this way is called a (P, F)- colouring of 5*. (Such 
a colouring is uniquely determined by (P, P) and the map P — > [q] which performs the 
assignment of colours.) 

Let n(S, P, P) be the number of (P, P)-colourings of S. By definition of fi + we have 

ce[ 3 ] s (P,F):|F|^|S| 

The first term corresponds to P = F with |P| = \S\, arising from a colouring c G [g] 5 in 
which every vertex in S receives a distinct free colour. We use g' 5 ' as an upper bound 
for the number of such colourings. For all other values of (S, P, P) we have the following 
crude bound: 

n(S,P,F) < qf n{qi > lPHFl} (q - qi )^ < \dSr^ ds ^ q^ < d + q^, 

where gi = and we recall that all parts must be coloured differently. Substi- 

tuting gives 

Z x ,s<q lSl + £ d + q \F\\» + (\s\-\F\). 

(P,F):\F\^\S\ 

Now applying the bound on q from the theorem statement gives 
Zx,s 



q\s\ 



<1+ £ S+g^Hsi^+OSH^I) 

(P,F):\F\^\S\ 

< i + J2 d + ((2s) s+1 d + v& A^ + )I F I-I S I \» + W\-\f\) 

(P,F):\FMS\ 

<1+ £ ((2s) s+1 *) |FHS| . (6) 

(P,F):\F\^\S\ 
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The number of terms in the above sum is at most (2|S , |)' S ', since there are at most \S\^ S ^ 
choices of the partition P and at most 2' p ' < 2' 5 ' choices of F. 
Next, note that 

^d(v) ^ w ( il> d (v)\ _ ^ ifj a (v) 



¥ = max — — — > &„ — — — = > p( 



for any probability distribution p on V. In particular, we can take p(v) = tp(v)/N, 
where 

N = ^(v) = J2^(S)\S\<s. 

vev ses 

With this choice of p, we obtain the bound 

* > N' 1 ^^d(v) = N- 1 ^( S ) \ dS \ > 

since OS is nonempty for all S G S, as G is connected and V S. It follows that 
(2s) s+1 $ > 1, and combining this with gives 

< 1 + 



Inverting this and using the identity (l + y) _1 >l — y establishes (jSJ), completing the 
proof. □ 



2.4 Block dynamics for specific examples 

In this subsection we illustrate how one can use Theorem 12.31 to obtain rapid mixing 
results for block dynamics on graphs of bounded degree. In the next subsection, we shall 
see how these results for block dynamics can be translated into rapid mixing results for 
Glauber dynamics. 

In order to build some intuition, we begin by investigating the range of possible 
values of the parameter fi + . We will need the following notation: given T C T 1 C V, we 
write vol(T, T") for the set of edges of G that are contained in T' and have at least one 
endvertex in T. 

Proposition 2.4. Let G = (V, E) be a graph of maximum degree A and let § be any 
block system for G. Then 

^+ = ^+(§) < A. 

If in addition G is regular then 

| < p+(S) < A. 
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Proof. First fix X G Q and S G S. Given a colouring c G [q] s , let P be the partition 
of S defined by the nonempty colour classes of c. Define F C P to be the set of colour 
classes of c which correspond to a colour which does not appear on X\q$. Let 

A F =\J A 

AeF 

and 

A' F = |J A. 

AeF:\A\>2 

Since G has maximum degree A, a trivial upper bound on fix,s( c ) is AIS^. But note 
that if a monochromatic edge e is incident to a vertex in Ap, then e must have both 
endpoints in the same part A of P. Thus edges incident to vertices in A F \ A' F do not 
contribute to fix,s( c ) an d monochromatic edges incident to vertices in A' F are double 
counted in the trivial bound. Hence 

Hx,s(c) < A(|£1 

= 

<A(|5| 

Hence the upper bound holds, by definition of 

Next, suppose that G is A-regular with X G f2 and 5 G S. Consider any colouring 
c G [g] 5 which assigns a single colour to all of S, and where this is the only colour used 
in X\qs- Then 

+ Vx,s(c) \vo\(S,SUdS)\ A 
^ " \S\-\F\ \S\ ~ 2' 

where the last inequality follows because G is regular of degree A. 

□ 

Next we show how to improve the upper bound on n + given in Proposition 12.41 by 
choosing our block system more carefully. 

Let k > 2 be an integer and let G = (V, E) be a graph with n vertices and with 
maximum degree A. Let 

S = {S v : v G V} 

where for all v G V the set S v C V satisfies v G S v , \S V \ = k and G[S V ] is connected. 
Then S is called a k-block system for G. Let ip be the uniform distribution over S. To 
apply Theorem 12.31 to the (S,^)-block dynamics we will calculate upper bounds on the 
parameters <9 + , ^ and 

Clearly \dS\ < Ak and min{A;, \dS\} < k for all S G S. Hence 

9+ < (Ak) k . (7) 
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To compute observe first that ip(v) > 1/n for all v G V as there are n blocks and 
each vertex belongs to at least one block. Next, observe that ipa(v) < Indeed, if 
v G dS u for some u G V then u is at distance at most k from v. Since there are at most 
A k vertices (excluding v ) at distance at most k from v in G, there are at most A k out 
of n blocks containing u in their boundary. Therefore 

* = max^< AK (8) 

In order to calculate an upper bound on /i + we first prove a preliminary result. For 
T C T' C V, recall the notation vol(T, T") introduced above Proposition 12 A\ and note 
that vol(T, T) is just the set of edges inside T. If T C T' then we write E(T, T') for the 
set of edges that are contained in T' and have exactly one endvertex in T. 

For any two sets A, B, we write 5a,b for the indicator function that A = B, that is 
&a,b — 1 if A — B and Sa,b = otherwise. 

Proposition 2.5. Let H = (V,E) be a connected graph and let U C V. Then 

\vol(U,V)\>\U\-8 uy . 

Proof. It is sufficient to prove the statement for H a tree. The statement is clear if 
U = V. Now suppose that U ^ V and consider the components C±, . . . , C r of H[U}. 
Then vol(Cj, V) has at least \Ci\ edges and is disjoint from vol(Cj, V) for all j ^ i. Thus 

r r 

|vol(C/,y)| = Ma, vol > ^ |Ci| = |c/|. 

i=l i=l 

□ 

Next we give an upper bound on the parameter fi + for /c-block systems. For k > 2 
this bound is a slight improvement on the upper bound given in Proposition 12.41 

Lemma 2.6. Let G = (V, E) be a connected graph with n vertices and maximum degree 
A. Fix an integer k G {2, . . . , n — 1} and let S be any k-block system for G. Then 

H + = /U + (S) < A - 1 + i 

Proof. Fix Ig!1= [q} v and v G V. Given a colouring c G [g] , let P be the partition 
of S v defined by the nonempty colour classes of c. Define F C P to be the set of colour 
classes of c which correspond to a colour which does not appear on X\qs v - 
Let 

A F =\JA, A T =\jA 

AeF A£F 

and define of = \Ap\ and a-p = \Ap\. Writing fix, v = Hx,s v for ease of notation, we have 

nx Ac) < ( i vol ( A ' A ) i ) + ( i vol ( A ' A u 9S *< 

\AeF J \A£F 

< l^2\vo\(A,A)\) +\vo\(A T ,A T UdS v )\. (9) 
\AeF J 
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Observe that 



\vo\(A T ,A T UdS v )\ < Aa T - \vo\(A T , S v )\ < (A - l)a T + 5 F>9 , (10) 

where the last inequality follows by Proposition 12.51 and noting that <5a f ,s„ = 5f$- 
Next we claim that for A £ P we have 

|vol(A^)|<(H-l)(A-l). (11) 

To ease notation, write a — \ A\. If a = 1,2 then (jlip clearly holds (noting that A > 2 
since G is connected). Next, (II ip holds for A = 2 since we have |vol(A, A) \ < a—1, where 
the "—1" appears because there is at least one edge leaving A (since G is connected). If 
a = 3 and A > 3 then \vol(A, A)\ < 3 and (a - 1)(A - 1) > 4, so ([II]) holds. For a > 4 
and A > 3, we note that |vol(A,A)| < Aa/2 and check that Aa/2 < (a - 1)(A - 1) 
holds in this case. This proves the claim, establishing ffTT]) . 
Therefore 

\MAA)\ < J2(\ A \ - !)( A - !) = - l F D( A - !)• ( 12 ) 

Combining (JSJ), (dD]) , and flTJ), we have 

/ix,,(c) < (a F - \F\)(A - 1) + (A - l)a ¥ + 5 F3 
= (A-l)(k-\F\) + 6 Ffi . 

Assuming that \F\ ^ k, dividing by k — \F\ gives the ratio A — 1 if F ^ and gives 
A — 1 + k^ 1 if F = 0. This completes the proof. □ 

Substituting ([7]), ()8]) and the result of Lemma T2.6I into Theorem 12.31 gives the follow- 
ing, noting that ^ min > i. 

Theorem 2.7. Lei G = (V, E) be a connected graph with n vertices and maximum 
degree A. Fix an integer k £ {2, . . . , n — 1} and let $ be a k-block system for G. Let ip 
be the uniform distribution on S. Fix A > 1. // 

i/jen r(A^BD,^) < 2nlog(ne _1 ). 

To further illustrate the use of Theorem 12.31 we apply it to the grid. Although our 
results are not as sharp as those discussed in [27], using the structure of the grid we 
are able to prove an upper bound on which is close to the lower bound given in 
Proposition 12.41 (See Lemma [2.91 below.) 

For convenience, rather than considering the n x n grid, we consider the toroidal 
n-grid G = (V, E), where V = (Z/nZ) 2 , and (a, 6)(c, d) £ E if and only if, in Z/nZ, 

either (a — c = ±1 and b — d = 0) or (b — d = ±1 and a — c = 0). 
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Note that the toroidal n-grid has n 2 vertices. The arguments below can be adapted to 
higher dimensions and to graphs with different grid topologies provided that the graph 
is locally a grid. 

Let S be the set of all r x r subgrids of G, where r < n — 2. Then S is a r 2 -block 
system: given S 6 S, write S = S v where v is the vertex at the "top left" corner of S. 
Let ip be the uniform distribution on §. To apply Theorem 12 .31 we must calculate upper 
bounds on the parameters. 

Firstly, note that 

8 + = (4r) 4r (13) 

since \dS\ = 4r for all S G S. Next, for v G V we have ip{v) = r 2 /n 2 and ipo(v) = 4r /n 2 , 
and so 

* = -. (14) 
r 

In order to obtain a tighter bound on /x + we need more information about expansion 
properties of the grid. 

Lemma 2.8. Let G = (V, E) be an n x n grid and let S be the vertices of an r x r 
subgrid. If T C S and \T\ = t' then \vo\(T,T)\ < 2t' - l^ft! and |vol(T, S U dS)\ > 
2? + 2y/jf. 

Proof. For T C S, we define T = (S U 9S") \ T. First, we claim that 

if \E(T,T)\<At then \T\ < t 2 . (15) 

To prove the claim, let us choose T such that \T\ is maximised subject to \E(T, T)\ < At. 
We may assume that G[T] is connected or else we can translate components to connect 
G[T] without increasing \E(T,T)\. Furthermore, we may assume that T is convex 
(that is, T is a rectangular subgrid) because if T has any "missing corners" (that is, a 
vertex outside T with at least two neighbours in T) then we can add the missing vertex 
without increasing \E(T,T)\. It is also easy to verify that amongst the rectangles with 
\E(T,T)\ = At, the square (with t 2 vertices) has the largest area. This completes the 
proof of the claim. 

Now suppose that |T| = t! . Using the contrapositive of ( fl5|) . we have 

2|vol(T,T)| =A\T\ - \E{T,T)\ <At'-AVtf, 

and dividing by two establishes the first statement. The second statement follows since 

\vol(T,SUdS)\ =A\T\ - |vol(T,T)|. 

□ 

For the toroidal grid, we may now give an upper bound for the parameter /i + which 
is close to the lower bound proved in Proposition 12.41 
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Lemma 2.9. Let G be the toroidal grid with n 2 vertices, and let S be the r 2 '-block system 
consisting of all r x r subgrids of G. Then 

Proof. Suppose that IgO and v G V. For a given c G [q] Sv , let P be the corresponding 
partition of S v given by the colour classes of c. As usual, let F C P be the set of colour 
classes corresponding to colours which do not appear on X\q$ v . 

Recall the notation Ap, Ap, a F and a-p introduced in Lemma [2761 As in (Q we write 
Hx,v for fix,s v , and find that 



Hx,v( C ) ^ 



^|vol(A,A)|) +\vo\(A T ,A ¥ UdS v )\. 



Using Lemma dU we have 

\vol(A,A)\ < ^2(|A| - y/\A\) = lap - ^2^\A\ < 2a F - 2\F\. 

AeF AeF AeF 

In order to bound | vol (Ay, A F \JdS v )\, observe first that vol(S„, S v UdS v ) is the disjoint 
union of vol(A-p, A-p U dS v ) and vol(Ap, S v U dS v ). Thus 

\vol(Ap, A T U dS v )\ = \vo\(S v , S v U dS v )\ - \vol(A F , S v U dS v )\ 
= 2r 2 + 2r- \vol(A F , S v U dS v )\ 

< 2r 2 + 2r- 2a F - 2jap~ by Lemma EE 

< 2r 2 + 2r - 2a F - 2^/\F\. 
Combining the three inequalities above, we have 

VxAc) < 2 ( r2 - \ F \) + 2(r - y/\F\) = (r 2 - \F\) I 2 + 



, , F \ 

<(r 2 -\F\){2 + l). 

For all F with \F\ ^ r 2 , dividing by r 2 — \F\ gives the value 2 + -, completing the 
proof. □ 

Substituting ( TT3l) . ( f!4|) and the result of Lemma 12.91 into Theorem 12.31 gives the 
following, noting that ^min = r 2 /n 2 . 

Theorem 2.10. Let G = (V,E) be the toroidal n-grid, let S be the r 2 -block system 
consisting of the set of r x r subgrids of G, for some r < n — 2. Given A > 1, if 

q > 2 r ' 2+8r+3 r 2r2+4r+1 \ 2+ 'r 

then for Mbd = M^ D (G, X,q), we have t(Mbd, e) < 2n 2 log(n 2 e~ 1 )/r 2 . 
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2.5 Glauber dynamics via Markov chain comparison 

We now describe the machinery needed to compare the mixing times of the Glauber 
dynamics and the block dynamics considered in the previous two subsections. We closely 
follow 0. 

Suppose that Ai is a reversible, ergodic Markov chain on state space Q with transition 
matrix P and stationary distribution tt. Let Ai' be another reversible, ergodic Markov 
chain on Q with transition matrix P' and the same stationary distribution. 

We say a transition (x, y) of Ai (respectively, Ai') is positive if P(x,y) > (re- 
spectively, P'(x,y) > 0); here we allow the possibility that x = y. For every positive 
transition (x, y) of Ai' , let V x>y be the set of paths 7 = (x = x , . . . , xj. — y) such that 
all the Xi are distinct and each (xj,Xj + i) is a positive transition of Ai. Let V = UV x , y , 
where the union is taken over all positive transitions (x, y) of Ai' with x 7^ y. 

We write I7I to denote the length of the path 7 so that, for example, \^\ = k for 
7 = (x , • • .,x k ). 

An (Ai, .M')-flow is a function / from V to the interval [0,1] such that for every 
positive transition (x, y) of Ai' with x 7^ y, we have 

= K{x)P'(x,y). 

For a positive transition (z, w) of Ai, the congestion of (z, w) is defined to be 



A ZjW (f) = -TXfT ( V J2 ItI/(7)- 

7TI 7 \ HI 7 111 I ^— ' 



The congestion of the flow is defined to be A(f) = max A ZtW (f), where the maximum is 
taken over all positive transitions (z, w) of Ai with z 7^ w. We now state a result that 
relates the mixing times of Ai and Ai' in terms of A(f). 

We will use the following theorem, which is slightly less general than [7J Theorem 

10]. 

Theorem 2.11. Theorem 10] Suppose that Ai is a reversible ergodic Markov chain 
with transition matrix P and stationary distribution tt and that Ai' is another re- 
versible ergodic Markov chain with the same stationary distribution. Suppose that f 
is an (Ai, Ai')- flow. If Ai has no negative eigenvalues then for any < 5 < \, we have 



r x (M,e)<A(f) 



r(M',6) • 
log(l/25) 



log- 



ETT(X) 



Now we apply the above theorem to compare the mixing time of the Glauber dy- 
namics and the block dynamics. Write r(Ai') = r(Ai', ^-). 

Lemma 2.12. Let G = (V,E) be an n-vertex graph of maximum degree A. Given 
X > 1, a positive integer q, a block system § for G with maximum block size s, and ip a 
probability distribution on S, write Ai = AicD^G, A, q) and At' = Ai^(G, X, q). Then 
for all e > we have 

-(Ai,e) < 2s q s+1 A A(S+1) r(Ai') n (n log (qX A/2 ) + log^ 1 )) . 



t 
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Proof. As before, let P and P' be the transition matrices of Ai and Ai' respectively. 
We note at the outset that both Ai and Ai' have the Gibbs distribution 7r as their 
stationary distribution. It is proved in [HI Section 1.1] that the Glauber dynamics Ai 
has no negative eigenvalues, so we may apply Theorem 12.111 

We construct an (At, A^-flow and analyse its congestion. Recall that a transition 
in Ai' is obtained by starting at some X G Q = [q] v , selecting S G S at random using 
the distribution ip and then updating the colouring of S to some colouring c G [q] s 
chosen randomly using the distribution (ft = 4>x,s- The resulting colouring is denoted by 
X^ c \ Let h{X,S,c) := i)(S)<p X) s{ c ) be the probability that this pair (S, c) is chosen. 
In particular, if (X, Y) is a transition of Ai' then 

P\X,Y)= J2 h (X,S,c), 

(S,c):Y=cUX( S ' c ) 

Fix an ordering of the vertices of G. For each X G £1, S G S, and a colouring c G [q] s of 
S, we define the path "f(X, S, c) from X to X( S ' C ) as follows: starting from X, consider 
each vertex v G {u G S : X(w) 7^ c(w)}, one at a time and in increasing vertex order, 
and change the colour of v from X(v ) to c(t> ). Thus 7(X, S", c) is a path in f2 from X to 
using positive transitions of Ai. 
We define an (M,M')-G.aw f by setting f(<y(X,S,c)) = n(X)h(X, S, c) for all 
(X, S", c) and 7(7) = for all other paths 7. To verify that this is indeed an (A4, A4' )- 
flow, given a positive transition (X, Y) of Ai' with X 7^ Y , we have 

E /W= E /(7)= E ^)M^S, C ) = 7r(X)P'(X,y). 

1^V X ,Y ■y=^(X,S,c):Y=X(- S ' c '> (S,c): Y=X( S . C ) 

Next we bound the congestion of this flow. Let (Z, W) be a positive transition of Ai 
with Z 7^ VK. Then the colourings Z and W differ on only one vertex, say v. The path 
7(X, S, c) uses the transition (Z, W) only if v G 5 1 and the colourings X and Z differ on 
a subset of S*. Thus we have 

Az.wU) = Az) p^ w) E iTl /(7) 

* *{Z)P(Z,W) E E E |S|-/(7(^,S,c)) 



S^eS X:X|g-=Z|e- ce[g] 



s 



s ,(z)p'z ^ E E E'WW*. 



< 



E E ?§m. 



P(Z,W) ^ ^ tt(Z) 



If X and Z differ on at most s vertices, and hence on at most As edges, then 

< A As . 



tt(Z) 
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Also, for any positive transition (Z, W) of M. we have 

P(Z, W)- 1 < q\ A n. 
Substituting these upper bounds gives 

A z ,w(f) < sq\ A n ^ j>(S) A ^ - sqX A q s X As ij(v)n < sq s+1 A A(s+1) n, 

S:veS X:X\^=Z\^ 

since ip(v ) < 1. We conclude that A(f) < sq s+1 X A ( s+1 ^n. 

Now apply Theorem 12 . 1 1 1 with 5 = l/(2e). For all Z G fi, we have the crude bound 

tt(Z) > (q n X m )- 1 > (q n X An/2 )-\ 

which leads to 

r(M,e) < sq s+1 X A{s+1) n [t(M') + 1] log (<f A^V 1 ) 

< 2s g -+iA A ( s+1 ) r(M') n (n hg(qX A/2 ) + hgie" 1 )) , 

as claimed. □ 

We would expect that the mixing time for Glauber dynamics should decrease as q 
increases, but the bound given in Lemma 12.121 becomes worse for larger values of q. 
However, by combining Lemma 12.121 with Proposition 12.21 we can avoid this problem. 

Corollary 2.13. Let G = {V,E) be an n-vertex graph of maximum degree A. Given 
X > 1, a positive integer q, a block system § for G with maximum block size s, and ip a 
probability distribution on §>, write M. = M.gb(G, A, q) and M.' = Ai^(G, A, q). Then 
for e > we have 



r{M,e) < 



'2s(AA 2A ) s+1 t(M') n (n log(AA 3A / 2 ) + log^" 1 )) if q < AA A + I, 
(A + l)7ilog(ne _1 ) z/g>AA A + l. 



Proof. If q < AA A + 1 then the corollary holds by Lemma 12.121 while if q > AA A + 1 
then the corollary holds by Proposition 12.21 □ 

We complete this section by applying the previous corollary to the block dynamics 
results obtained in the previous subsection to obtain rapid mixing results for Glauber 
dynamics. 

Theorem 2.14. Let G = {V,E) be an n-vertex connected graph with maximum degree 
A, and fix X > 1. For every positive integer k < n, if q > 2 fc+1 A 2A: A; 2A:+1 A A-1+ ' c then 
for Mgd = Mgd(G, A, q), we have 

t(-M G d, e) < Ak (A A 2A ) fc+1 n 2 log(2en) (n log (AA 3A/2 ) + log^ 1 )) . 
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Proof. Apply Corollary 12.131 to the mixing time of the block dynamics in Theorem 12.71 

□ 

Theorem 2.15. Let G = (V,E) be the toroidal n- grid, and fix X > 1. For every positive 
integer r < n, if q > 2 r2+8r+3 r 2r2+4r+1 X 2+ r then for Mqd = Mgd(G, X,q), we have 

t{M gd , e)<A (4A 8 ) r2+1 n A log(2en 2 ) (n 2 log(4A 6 ) + log^ 1 )) . 

Proof. We apply Corollary 12.131 to the mixing time of the block dynamics in Theo- 
rem &m □ 



3 An extremal problem 

In this section, we investigate how large the partition function of a bounded-degree 
graph can be. We require this result in the next section, where we give bounds on the 
number of colours below which Glauber dynamics mixes slowly, although the result may 
be of independent interest. 

In this section, we allow graphs to have multiple edges, but not loops. For fixed 
numbers n the number of vertices, m the number of edges, A the maximum degree, 
A > 1 the fugacity, and q the number of colours, we define 

Z((n, m, A), A, q) = max Z(G, A, q), 

G 

where the maximum is over all graphs G with n vertices, m edges, and maximum degree 
A. 

We now describe the class of graphs that will turn out to be extremal for the above 
parameter. Fix positive integers n, m, and A such that A divides m and m < An/2. 
Let H(n, m, A) = (V, E), where V is a set of n vertices and E is obtained by taking any 
set of m/A independent edges on V and replacing each edge with A multi-edges. Thus 
H(n, m, A) has m edges and maximum degree A. 

The main result of this section is the following. 

Theorem 3.1. If G is an n-vertex graph with m edges and maximum degree A, and 
q G N and A > 1 are given, then 

Z(G,A,g)<(l + g- 1 (A A -l)) rm/A1 q\ 

In particular, if A divides m, we have equality above for G = H(n, m, A). 

This will immediately give us the following corollary. 

Corollary 3.2. Let n, m, A e PJ be fixed. Given a number of colours q, and fugacity 
A > 1, we have 

Z((n, m, A), A, ?)<(! + q-\X A - 1)) ^ <f . 
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We begin by giving a brief outline of the proof. Given an n-vertex multigraph 
G = (V,E), and a uniformly random (non-proper) g-colouring a of V, let X be the 
number of monochromatic edges of G in a. Observe that Z(G, A, q) = E(A x )g n . We 
proceed by decomposing the edges of G into A forests with |~m/A] or [m/A\ edges 
each. Then we establish that the number of monochromatic edges in a forest with m! 
edges is distributed as X ~ Bin(m',g _1 ). This allows us to obtain a bound on E(A X ) 
and hence prove Theorem 13.11 

Lemma 3.3. Let G = (V, E) be a multigraph with n vertices, m edges, and maximum 
degree A. We can find A forests F\, . . . , Fa on the vertex set V such that each Fi has 
\m/A] or \m/A\ edges and the edges of F\, . . . , Fa form a partition of E. 

Proof. We begin by disregarding the condition that the forests should have almost equal 
size. In this case, we can decompose E into A forests by iteratively removing a maximal 
acyclic set of edges Fj from Fj = F\(Fi U ■ ■ ■ UF_i). By removing F from Gi = (V, Fj), 
we reduce the degree of every non-isolated vertex by at least one, and so, in particular, 
we reduce the maximum degree of Gi by at least one. Thus, after at most A steps there 
are no edges remaining in the graph, giving a decomposition of F into (the edge sets of) 
A forests, Fx, ... , Fa. 

We denote by |Fj| the size of the forest Fj, that is, the number of edges in Fj. In 
order to make these forests of equal size, observe that if |F| > \Fj\ + 1, then Fj has 
fewer components than Fj, so Fj has at least one edge that connects two components of 
Fj. Removing this edge from Fj and adding it to Fj keeps both F and Fj acyclic, but 
reduces the imbalance in their sizes. Iteratively applying this operation to any pair of 
forests of unequal size eventually results in all forests being of size [to/ A] or [to/ A J . □ 

Lemma 3.4. Let F = (V, F) be a forest and let a be a uniformly random q-colouring of 
V. Let X be the number of monochromatic edges of F. Then X ~ Bin(m,g~ 1 ) ; where 
m is the number of edges in F . 

Proof. It is sufficient to consider the case when F is a tree. For if not, then we can 
consider the components of F independently, and use the fact that the sum of t inde- 
pendent binomial random variables of the form Bm(rrij,p) is a binomial random variable 
Bin(mi + ■ ■ ■ + m t ,p). 

Now assume that F is a tree, and root F at a vertex vq. Let vo, . . . ,v n -i be any 
ordering of the vertices in V such that for every i, the parent of fj is a member of 
{v\, . . . , fj-i}. We generate a uniformly random g-colouring of V by colouring each ver- 
tex with a uniformly random colour from [q], independently, in the specified order. Each 
vertex has probability 1/q of being given the same colour as its parent, independently of 
all previous choices, and hence each edge has probability 1/q of being monochromatic, 
independently of all previous choices. Therefore the total number of monochromatic 
edges satisfies X ~ Bin(m, q^ 1 ). □ 

We will also need the following result, which follows from a generalization of Holder's 
inequality. 
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Lemma 3.5. Let (X±, . . . ,Xd) be a random, M d -valued vector, and suppose there exists 
a random variable X such that Xi ~ X for all i = 1, . . . , d. Then for all A > we have 



E(X Xl+ " +Xd ) < E(A 

Proof. Let Zj = X X] and pj = d for j = 1 , . . . , d. Then the result follows from the 
generalised Holder's inequality, which states that 

(d \ d 

\{\Z 3 \ < JJ(E|Zjp) lM 
3=1 J 3=1 

for any random variables Zx, . . . , Z d and any pj > 1 such that Yuj=i ^/Pj = ^ (^ ee ^ or 
example [5].) □ 

We are now ready to prove Theorem 13.11 

Proof of Theorem \3.1[ By Lemma 13. 3[ we can decompose the edges of G into A forests 
Fx, . . . , -Fa, such that rrii, the number of edges in Fi, is either [m/A] or \m/ A\ . 

Let a be a random g-colouring of V, and let Xi be the number of monochromatic 
edges of Fi in the colouring a. We know by Lemma [3.41 that Xi ~ Bin(mj,g _1 ). Then 
/J,(cr), the number of monochromatic edges of G in a, is given by /i(er) = X\ + ■ — h X& 
and 

Z(G,X,q) = q n E{\^) = q n E(X Xl+ - +XA ). 

For each i — 1, . . . , A, choose 3^ ~ Bin( [m/A] , g" 1 ) such that P(Yj > Xj) = 1. Then 
using the above and Lemma I3.5[ we have 

Z(G,X,q) < q n E{X Xl+ - +XA ) < q n E{X Yl+ - +YA ) < q n E(X AYl ) 

= q n (l + q- 1 (X A -1))^/ A 1. 

The last equality holds because Y\ ~ Bin( [m/A] , so 

[m/A] /r /A -|N 

E(A Ayi )= [ 1 )g- i (l-g- 1 ) rm/A1 - i A Ai =(l + g- 1 (A A -l)) rm/A1 . 
*=o ^ 2 / 

Finally, it is easy to check that Z(H(n, m, A), A, q) = q n (1 + g _1 (A A — l)) m / A when 
A divides m. □ 



4 Slow mixing 

We have seen in Section 12.21 that for general graphs with maximum degree A, the 
Glauber dynamics mixes rapidly if q > AA A + 1. Some improvements on this were 
given in Section 12.51 In this section, we shall see that these general bounds cannot be 
improved by much (in terms of the exponent of A). We give a bound on the number 
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of colours below which Glauber dynamics almost surely mixes slowly for a uniformly 
random A-regular graph. 

The technical tool used for most slow-mixing proofs is conductance. We now intro- 
duce the necessary definitions, following [7]. Again, AA is a Markov chain with state 
space Q, transition matrix P and stationary distribution tc. For A, B C Q, define 

Q M (A,B)= <x)P{ Xi y). 

x£A,y£B 

We define _ 

MK ' 7r(A)7r(A)' 

where A := Q\A. Finally, we define the conductance of M. as 

$M :=min$^(A). 

ACQ 

We drop the subscript when the Markov chain is clear from the context. Recall that 
t(M) = t(JA, y)- Conductance gives a lower bound for the mixing time of a Markov 
chain via the following result. 

Theorem 4.1. [3, Theorem 17] Let Ai be an ergodic Markov chain with transition 
matrix P, stationary distribution n and conductance $. Then 

r{M) > ' ~ ' 



2e$ 



M 

Suppose now that G = (V, E) is an n- vertex graph, A > 1 is given, and q is a number 
of colours. By Theorem 14.11 in order to show that Ai = A4qd(G, A, q) mixes slowly, it 
is sufficient to show that its conductance $_A4 is exponentially small in n. 

We will need some more definitions. For i £ [q] and a £ Q, define 

Oi = \{v £ V : a(v) = i}\. 

Next, define the r-shell and r-ball around a colour i as follows: 

S r (i) = {cr : CTj = n — r}, B r (i) = {a : Uj > n — r}. 

We see that B r (i) is the set of colourings at distance at most r from the all-* colouring, 
and S r (i) is the set of colourings at distance exactly r from the all-* colouring. To 
simplify notation, we write B r = B r (l) and S r = S r (l) for the r-ball and r-shell around 
colour 1. 

For an n- vertex graph G = (V, E) and r is a positive integer satisfying r < n/2, we 
define 

a r (G) = - max e G (S), 

r scv 
\S\=r 

where ec{S) is the number of edges of G inside S. This quantity is low when the edge- 
expansion of r-vertex subgraphs of G is high. We now establish a uniform bound on the 
conductance of A4gb(G, A, q) which holds when a r {G) and q are sufficiently small. 
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Lemma 4.2. Let A > 1 and let A > 2 &e an integer. Fix k E (l, ^] and /e£ /3 G (0, 1). 
Suppose that n > /3~ 1 (2 + A log 2 A) zs an integer and let r = [f3n\ . Let G be a A-regular, 
n-vertex graph such that a r {G) < k. Finally, suppose that q > 2 is an integer which 
satisfies 
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Then the conductance of the Markov chain M. = A4gd(G, A, q) is bounded by 

(17) 

Proof. We bound by estimating ^j^(B r ). Let P be the transition matrix for Ai 
and let 7r be the stationary distribution of M. (that is, the Gibbs distribution). We have 

A ^ /n \ EsgB^eB^fr)^ 3 ^) Exgg t ., yg B7 7r ( a; ) P ( a; ^) 
SPm <- (P/Vfl-DrJ = = = 

V ; n(B r )Tr(B r ) Tr(B r )n(B r ) 

ir(S r ) 



< 



< 



2n(S r ) 
7i(B r ) ' 



where the last inequality follows because ir(B r ) > | (assuming that g > 2). 

Let Z = Z(G,X,q) be the partition function and write m = An/2 for the number 
of edges in G. Now n(B r ) > Z~ x \ m since the all-1 colouring belongs to B r . Next we 
obtain a lower bound on ir(S r ). 

Suppose that ACV with \A\ = r. Writing E(A) for the set of edges of G inside A, 
we know that |-E(A)| < a r (G)r < nr, and hence 

\E(A)\ =m- \E(A,A)\ - \E(A)\ = m - (Ar - 2\E(A)\) - \E(A)\ 

= m- Ar + \E(A)\ 
< m — (A — /t)r. 

Therefore 

n(S r ) = Z- 1 AMW = Z' 1 Ximi -Z(G[A],\,q- 1) 

<Z- X \ m - (A - K)r ■ Z(G[A],X,q-l) 

ACV:\A\=r 

< 



z -l Q A m-(A-«), . z((rj [/tr] > A)> A> , _ !). 



The final inequality uses the fact that when A > 1, the partition function is nondecreas- 
ing under the addition of edges. Combining these bounds shows that 



®M < 2 \- {A ~ K)r • Z((r, \kt] , A), A, q - 1). 



(18) 
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Using Corollary 13. 2\ we have 

Z((r, r«rl , A), A, q - 1) < (1 + (q - 1)- 1 A A ) ^ (q - 1)' 

< (2(g- l)- 1 A A ) rKr/A1 (g- l) r 

< 2A A (2(g - l)- 1 A A ) Kr/A (g - l) r 

< (4A K a - 1 — 



Here the second inequality uses the fact that q — 1 < A A (which follows from ffl6|) ). and 
the final inequality follows since k/A < | as well as the fact that 2 r > 2A A (by our 
choice of sufficiently large n). Substituting this into ( I18p and applying the well-known 
inequality 

r ) ~ r\ ~ y/2nr \ r J 
gives 

<$> M <^L= f^A- A - 2K (g-l)( A ^/ A V. 
Now raising both sides of (fl6l) to the power (A — k)/ A and rearranging shows that 

^A-( A - 2 %-l)^<^<i. 
r yH ' ~ 4r ~ 2 

Therefore <&m < -J= 2~ r , as claimed. □ 

Let A denote the uniform probability space of all A-regular graphs on the vertex 
set [n] = {1,2, ... ,n}, restricting to n even if A is odd. That is, U G 6 G h ,a" means 
that G is a uniformly chosen A-regular graph on the vertex set [n]. In a sequence of 
probability spaces indexed by n, an event holds asymptotically almost surely (a.a.s.) if 
the probability that the event holds tends to 1 as n — >■ oo. 

Next, given k we show how to choose r in order to ensure that with high probability, 
a random A-regular graph G satisfies a r (G) < k. 

Lemma 4.3. Fix A > 3 and let k E (l, y] . Lei 

and /or eac/i positive integer n > j3~ x , define r = r(n) = \_j3n\, which is a positive 
integer. Let G G Q n ,A- Then a.a.s. a r (G) < k. 

Proof. We use the configuration model of Bollobas [2J to construct random regular 
graphs. In this model, to construct a random A-regular graph on n vertices, we take 
n sets (called buckets) each containing A labelled objects called points. Then we take 
a random partition P of the An points into An/2 pairs, where each pair is a set of 
two distinct points. We call P a pairing. By replacing each bucket by a vertex and 
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replacing each pair by an edge between the two corresponding vertices, we obtain a 
multigraph G(P), which may have loops and multiple edges. If G(P) is simple then it 
is A-regular. It has been shown [2] that a random pairing is simple with probability 
tending to exp (— ~ 1 ) as n — > oo. 

Let m(2a) denote the number of pairings of 2a points. It is well known that 

m(2a) = 
K J a\2 a 

Write [x] a = x(x — 1) • • • (x — a + 1) to denote the falling factorial. Now let V n> A denote 
the uniform probability space on the set of pairings with n buckets, each containing A 
points. Let B be a fixed set of r buckets. Given a positive integer s, let me(r, s) be the 
number of pairings in V n ^ in which at least s pairs are contained in B. We can obtain 
an overcount of me(r, s) in the following way. We first select s pairs within B, in 

[Ar] 2s 
s\2 s 

ways. Then we pair up the remaining An — 2s points in m(An — 2s) ways. Hence 
[Ar] 2s (An -2s)! (Ar)!(An - 2s)! 



m B (r, s) < 



s!2 s (An/2-s)!2 A "/2- s 2 A "/ 2 s!(Ar - 2s)!(An/2 - s) 



Therefore the probability p(r, s) that a random pairing in V n ^ has at least s pairs within 
B is 

Pl ' J m(An) "V s 7[An] 2s -V s A", 
Let X(r, s) be the random variable which counts the number of sets of r buckets 
which contain at least s pairs of P, for P E V n ,&- Using the inequality (?) < (ea/b) b , 
we have 

n\ , f n \f An/2\ f r\ 2s /en\ r f Aer £ 



E(X(r,s))= p(r,s)< ' - < ^ 



r J \r J \ s J \n J \ r J \ 2sn 

Now fix s = \kt\ where, recall, r = [/3n\. By definition of ft we have Aer < 2nn, and 
hence 

f ne ( Aer* 



E(X(r, \kt\ )) < ( y J < ((2^)- K e K+1 A K /^)«\ 

When (fl9l) holds, we see that 

(2K)- K e K+1 A K f3 K ~ 1 < 2- {K - 1) 

and this upper bound is a constant in (0, 1) which is independent of n. Since r > (3n— 1 
it follows that E(X(r, |~m"])) = o(l), and we conclude that 

E(X(r, W ) | G(P) is simple) < ^ffj^ = «(1). 

This shows that when ( fl9|) holds, a.a.s. G G £ n) A has the property that all subsets of 
vertices of size r have fewer than nr edges. □ 



28 



Now we can easily show that when q is sufficiently small and n is sufficiently large, 
the mixing time of the Glauber dynamics is slow for almost all A-regular graphs. 

Theorem 4.4. Fix A > 3 and let k G (l, . Suppose that (3 is defined by / TJPj) and 
let q > 2 be an integer which satisfies [To]) . Let G G S n ,A- Then a.a.s. the Glauber 
dynamics M. = M.qt>{G, A, q) satisfies 



t{M) > 2 /3n " 4 . 

Proof. For each positive integer n > /3~ 1 (2 + Alog 2 A), let r = r(n) = \_{3n\, which is 
a positive integer. By Lemma 14.31 we know that a.a.s. G G Q n ,£\ satisfies a r (G) < k. 
Hence a.a.s. the conductance of the corresponding Glauber dynamics -Mgd(G, A, q) is 
bounded above by 

_J_ 2 -r 
V2vrr 

by Lemma 14.21 Applying Theorem 14.11 completes the proof. □ 

We conclude this section by proving Theorem 11.21 and Theorem 11.31 

Proof of TheoremUM (i) Given 7] G (0, 1), let k = \r]" l ~\ and define Ci = 2 k+1 (Ak) 2k . 
If q > ciA A ~ 1+T? then q > ciA A ~ 1+1 / fc , by choice of k. Then the conclusion follows from 
Theorem EH 

For (ii), given r] G (0, 1) define k = 1 +r]/5. Since A > 3 we have k G (l, |) C (1, ^]. 
Define 



AV 2(1+ ^ 
2k 



° 2 1024 c 

By our choice of k and since A > 3, we have 

K 2 1 

k+- < l + -r -+rj. 

A — k A — 1 

Therefore, if 

q- 1 < c 2 X A - 1 -^-^ 
then f)16p holds, and the result follows by applying Theorem 14.41 □ 

Proof of Theorem \1.3[ The first and third statement follow from substituting A = 4 
into Theorem 11.21 (i) and (ii), respectively. (So c 3 is obtained by substituting A = 4 in 
Ci, and C5 is obtained from c 2 similarly.) 

For (ii), let k = [27]- 1 ] and define c 4 = (8k - 1) 2 fc2+8/c k 2k2+Ak . If q > c 4 X 2+r > then 
q > C4A 2+2 / fc , by definition of k. Then Theorem 12. 151 applies, completing the proof. □ 
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